RGame: Embodied Gaming for Robot Learning by Demonstration
نویسندگان
چکیده
We here demonstrate robot learning from demonstration using interactive tutelage. Our Dogged Learning architecture (introduced in (Grollman and Jenkins 2007a) and shown in Figure 1) combines concepts of teleoperative demonstration, mixed-initiative control, feedback and transparency, and real time policy inference. It is designed to be abstract, applicable to many platforms (robots), demonstration interfaces, and learning algorithms. Briefly, a robot platform provides perceptual inputs to the system, which are displayed to a demonstrator, as well as analyzed by a learned approximate policy. Both the demonstrator and the learner generate desired action outputs, as well as confidence values. The outputs can be NULL, and confidences 0, if either is not present. Using the confidences, the system arbitrates between the two outputs by, for example, taking the more confident output. Results of arbitration are displayed to the user, and the chosen action is passed to the robot, which performs as commanded. Additionally, if the commanded action is from the demonstrator, the action is paired with the perception that led to it, and the pair is used by the learner to update its policy approximation. The DL architecture is embedded within the RGame program, which allows for users who are not co-located with the robots to generate demonstration data for them. Further, the system collects data for learning, and manages the learned policies. It is a goal of our work that remote users can log in, train robots to perform tasks, and then continue to monitor and improve their performance over time.
منابع مشابه
Training a Robot via Human Feedback: A Case Study
We present a case study of applying a framework for learning from numeric human feedback—TAMER—to a physically embodied robot. In doing so, we also provide the first demonstration of the ability to train multiple behaviors by such feedback without algorithmic modifications and of a robot learning from free-form human-generated feedback without any further guidance or evaluative feedback. We des...
متن کاملWorkspace Boundary Avoidance in Robot Teaching by Demonstration Using Fuzzy Impedance Control
The present paper investigates an intuitive way of robot path planning, called robot teaching by demonstration. In this method, an operator holds the robot end-effector and moves it through a number of positions and orientations in order to teach it a desired task. The presented control architecture applies impedance control in such a way that the end-effector follows the operator’s hand with d...
متن کاملSelf-Organized Flocking with Agent Failure: Off-Line Optimization and Demonstration with Real Robots
This paper presents an investigation of flocking, the formation and maintenance of coherent group movement, by teams of autonomous mobile robots using principles of Swarm Intelligence. First, we present a simple flocking task, and we describe a leaderless distributed flocking algorithm (LD) that is more conducive to implementation on embodied agents than the established algorithms used in compu...
متن کاملRoboPatriots: George Mason University 2012 RoboCup Team
The RoboPatriots from George Mason University are a team of three humanoid robots. As we are interested in embodied AI, our robots are based on commercially available equipment. We use the three on-board computers for research into learning from demonstration, a supervised machine learning technique for training robot behavior. RoboCup 2012 marks the fourth year of participation for the RoboPat...
متن کاملThe Maryland Virtual Demonstrator Environment for Robot Imitation Learning
Robot imitation learning, where a robot autonomously generates actions required to accomplish a task demonstrated by a human, has emerged as a potential replacement for a more conventional hand-coded approach to programming robots. Many past studies in imitation learning have human demonstrators perform tasks in the real world. However, this approach is generally expensive and requires high-qua...
متن کامل